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JOINT RESOLUTION OR SHARPNESS ENHANCEMENT AND ARTIFACT REDUCTION FOR CODED 
DIGITAL VIDEO 

1 . Field of the Invention 

5 The present invention is a new approach for joint video enhancement and artifact 

reduction in order to achieve optimal picture quality for coded digital video. Video 
enhancement may include resolution enhancement or sharpness enhancement. More 
particularly, the present invention is a system and method that includes sharpness 
enhancement or resolution enhancement, artifact reduction, and a joint control to drive both 

10 post-processing units. Most particularly, the present invention provides a joint control that 
is based on a metric, such as the metric provided in the inventor's co-pending patent 
application entitled "A Unified Metric For Digital Video Processing (UMDVP)", the entire 
content of which is hereby incorporated by reference as if folly set forth herein, the metric 
being used to determine which pixel and by how much it is to be enhanced and to 

1 5 determine on which pixel and to what degree to carry out the artifact reduction. 

2. Description of the Related Art 

Moving Picture Expert Group (MPEG) video compression technology enables 
many current and emerging products, e.g., DVD players, high definition television 
decoders, and video conferencing, by requiring less storage and less bandwidth. 

20 Compression comes at the expense of a reduction in picture quality due to the introduction 
of artifacts. It is well known that such lossy compression technology (MPEG-1, MPEG-2, 
MPEG-4, H.26x, etc.) can cause the introduction of coding artifacts that decrease picture 
quality of the decoded video. In block-based coding techniques the most frequent artifacts 
are blockiness and ringing and numerous algorithms have been developed that address 

25 reduction of these various artifacts. While the common objective of these algorithms is to 
reduce the artifacts without decreasing any other desirable feature of the scene content 
(e.g., image sharpness and fine detail). In reality, the traditional sharpness enhancement 
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algorithms perform sub-optimally for encoded digital video, often enhancing coding 
artifacts already present, see inventors' co-pending patent application entitled " System and 
Method of Sharpness Enhancement for Coded Digital Video," the entire content of which 
is hereby incorporated by reference as if fully set forth herein. 

5 In another of the inventors* co-pending patent applications entitled "Unified Metric 

For Digital Video Processing," a metric for digital video processing is defined based on 
MPEG coding information and local spatial features. This metric determines how much a 
pixel can be enhanced without boosting coding artifacts. Experiments have shown that 
sharpness enhancement algorithms alone combined with this metric result in better picture 

10 quality than algorithms without it. However, the video still contains coding artifacts, that 
need to be removed to achieve optimal picture quality. 

Usually, in both de-blocking and de-ringing algorithms, the first step is to detect the 
artifacts, then the next steps apply artifact reduction on the localized area of the image 
having artifacts. If the artifact detection step is incorrect, resulting picture quality can be 

15 worse than before artifact reduction. Therefore, it is crucial to have reliable detection of 
coding artifacts. 

It is very likely that both resolution or sharpness enhancement algorithms and 
artifact reduction algorithms will co-exist in a system for receipt, storage, and further 
processing of a priori coded digital video, e.g., MPEG coded video. Current approaches to 
20 sharpness enhancement and artifact reduction are performed independently of one another 
and improvements resulting from one can impact the other negatively thereby decreasing 
picture quality. 

Thus, there is a need for a joint approach to resolution or sharpness enhancement 
(SE) and artifact reduction (AR) that identifies/differentiates coding artifacts and local fine 
25 details. The resolution enhancement consists of a scaling function and a sharpness 
enhancement algorithm. The present invention deals only with the sharpness enhancement 
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part of the resolution enhancement. The present invention is a unified approach for joint 
sharpness enhancement and artifact reduction that achieves optimal picture quality for 
coded digital video. 

Once a metric, such as UMDVP, identifies which pixel is a good candidate for 
5 enhancement and on which artifact removal has to be applied, the present invention 
provides a control to efficiently and reliably drive both SE and AR. In one embodiment, 
the present invention employs a metric to characterize the pixel containing coding artifacts 
such as blockiness and ringing. Then, based on the metric the control determines which 
and how many neighboring pixels are to be involved in AR without blurring relevant 
10 features such as edges and fine details in the vicinity of the original pixel. In addition, 
based on the metric the present invention determines how "aggressively" AR should be 
applied to a certain region or an individual pixel. 

In areas that are "artifact-free", the joint control of the present invention based on a 
unified metric can drive SE to enhance edges and textures in these areas such that the 
15 amount of enhancement applied can be controlled to achieve optimal picture quality. 

FIG. 1 illustrates a functional view of a post-processing system for a jointly 
controlling resolution or sharpness enhancement (SE) and artifact reduction (AR) of 
decoded video. 

FIG. 2 illustrates a flow diagram of the control level of the post-processing method 
20 of the present invention. 

FIG. 3 illustrates a flow diagram of an algorithm level UMDVP-controlled 
deringing of the luminance signal. 

FIG. 4 illustrates the neighborhood of UMDVP values of point (i j). 

FIG. 5 illustrates a flow diagram of an algorithm level UMDVP-controlled 
25 deringing of the chrominance signal. 
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Referring now to FIG. 1, an MPEG-2 decoder 10 decodes an input video signal and 
the decoded video signal 11 is input to the post-processing unit 18. The post-processing 
unit 18 comprises a sharpness enhancement module 12 or resolution enhancement module 
12a, and an artifact reduction module 13. 
5 The artifact reduction module 13 comprises at least one algorithm selected from the 

group comprising e.g. de-blocking , de-ringing etc.. 

A control module 17 uses a metric 19 to jointly control the application of post- 
processing to the decoded video signal 11 by the sharpness enhancement module 12 and 
the artifact reduction module 13. The control module 17 receives the metric 19 from a 
10 metric calculation module 16. The MPEG-2 decoder 10 sends coding information 15 as 
well the decoded video signal 11 to the metric calculation module 16. The output of the 
system is the post-processed video 14. 

Unified Metric for Digital Video Processing 

In a preferred embodiment, the metric calculation module 16 calculates "A Unified 
15 Metric for Digital Video Processing" (UMDVP), as described in the inventors' co-pending 
application of the same title. From the block-based coding information 15, the UMDVP 
metric is calculated and reflects the local picture quality of the MPEG-2 encoded video. 
The UMDVP is determined based on such block-based coding information as the 
quantization scale, number of bits spent to code a block, and picture type (I, P or B). Such 
20 coding information is obtained from the MPEG-2 bitstream for little computational cost. 
The coding information is sent by the decoder to the metric calculation module . The 
metric calculation module 16 can adapt the UMDVP to the local scene contents using local 
spatial features such as local variance. The spatial features are used to refine the metric to 
a pixel-based value to further improve the performance of the joint post-processing unit 18. 
25 The values of the UMDVP metric are in the range of [-1,1]. The lower the 

UMDVP value, the more likely the pixel is to have coding artifacts. In general, high 



WO 2004/054269 PCT/BB2003/005536 

positive UMDVP values indicate that the pixels should be sharpened and excluded from 
artifact reduction. The control module 17 receives the UMDVP metric 19 and uses this 
metric 19 to jointly control the sharpness enhancement module 12 and the artifact reduction 
module 13 of the post-processing unit 18. The value of metric 19 determines which of the 
5 post-processing modules is turned on, and in what order. For example, if the UMDVP 
metric 19 is smaller than a pre-determined threshold, VPJTHRED, sharpness enhancement 
module 12 is turned off and artifact reduction module 18 is turned on and if UMDVP 
metric is greater than or equal to the threshold VPJTHRED the artifact reduction module 1 8 
is turned off and the sharpness enhancement module 12 is turned on. It is not necessary to 
10 turn off one function completely. For example, if it is determined that AR has performed 
well at a region with artifacts, SE can be enabled to improve the sharpness in that region) 

While the UMDVP metric can indicate whether or to what degree to apply artifact 
reduction to a pixel, this metric does not provide a means to distinguish between different 
coding artifacts, such as blockiness or ringing. Thus, it is up to the artifact reduction 
15 module 13, once activated by the control module 17, to determine how to use the UMDVP 
metric to achieve a higher performance. For example, the value of the UMDVP metric 19 
determines how "aggressively" artifact reduction or sharpness enhancement should be 
performed. The lower the value of the UMDVP metric below the value of VPJTHRED the 
more artifact reduction the control unit 17 directs the artifact reduction unit 13 to perform. 
20 Otherwise, the larger the value of UMDVP is above VPJTHRED the more enhancement 
the sharpness enhancement module 12 is directed to perform by the control unit 17. 

The use of a metric in conjunction with VPJTHRED is illustrated in FIG. 2. The 
metric M = UMDVP is calculated from block-based coding information at step 20. The 
difference between the pre-determined threshold, VPJTHRED, and the calculated metric M 
25 is determined at step 21 using the equation 
AMT = M- VPJTHRED 

5 
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The value of AMT indicates how aggressively post-processing should be applied, i.e., in 
direct proportion to the absolute value of AMT. When AMT is positive, artifact reduction 
is turned off at step 24 and enhancement is turned on at step 25 with the amount of 
enhancement applied over a base level being in proportion to AMT, i.e., the aggressiveness 

5 of the enhancement. If AMT is not positive, i.e., 0 or negative, enhancement is turned off 
at step 22 and artifact reduction is turned on at step 23. Since the lower the value of M for 
a given block the more likely it is that a block has artifacts, more aggressive artifact 
reduction is performed when it is performed in proportion to |AMT|. 
Artifact Reduction Algorithms 

10 Many types of artifacts can be introduced by lossy encoding of a video signal and 

can be reduced using corresponding algorithms during post-processing by the post- 
processing unit 18. A metric, e.g., UMDVP, can be used to control when, where and how 
much post-processing is accomplished by these algorithms. 

Two types of artifacts that commonly occur in coded video streams are blockiness 

15 and ringing. Blockiness manifests itself as visible discontinuities at block boundaries due 
to the independent coding of adjacent blocks. Ringing is most evident along high contrast 
edges in areas of generally smooth texture and appears as ripples extending outwards from 
the edge. Ringing is caused by abrupt truncation of high frequency DCT components, 
which play significant roles in the representation of an edge. 

20 While blockiness and remedial de-blocking have been widely studied and many de- 

blocking algorithms have been developed, ringing has drawn less attention. In particular, 
satisfactory deringing algorithms for large high-contrast high-resolution displays are not 
present in the prior art and those that due exist are either based on simple spatial filtering 
resulting in a compromised picture quality, or their computational complexity prevents any 

25 implementation in the near-term. However, ringing artifacts can be even visible at higher 
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bit rates and are exaggerated on such displays as High-definition monitors, and are thus 
very annoying. 

Both de-blocking and de-ringing algorithms can be controlled by the system and 
method of the present invention. 
5 UMPVP-Controlled Deringing 

For purposes of example and not limitation, a deringing algorithm of the artifact 
reduction module is presented to illustrate how an appropriate metric can be used to control 
a post-processing algorithm. This deringing algorithm is based on adaptive spatial filtering 
and employs a metric, such as UMDVP, calculated by the metric calculation unit 16, to 

10 determine the location of the filtering (detection), the size of the filter, and which pixels are 
included or excluded in the filter window. Further, based on the value of the metric, the 
deringing algorithm adaptively determines how much a filtered pixel can differ from its 
original values, thus providing a control over the displacement that depends on the strength 
of the original compression. 

15 Ringing artifacts can occur in the chrominance components, resulting in colors that 

differ from the surrounding area, and, due to color sub-sampling, may spread through the 
entire macroblock. This problem is remedied by applying chrominance filtering in a 
carefully controlled way to prevent any color mismatch, 
a. Deringing for the Luminance Component 

20 By way of example and not limitation, a flow of the processing steps for a 

UMDVP-controlled deringing of the luminance signal is illustrated in FIG. 3. Consider a 
pixel located at position (ij), where the neighborhood of position (ij) is defined as 
illustrated in FIG. 4. At step 31 it is determined whether an isolated "0" UMDVP value is 
found in a neighborhood of UMDVP values of "1" and if so then UMDVP(ij) is set to 1, 

25 for the pixel at location (ij) at step 32. A neighborhood size is selected, in a preferred 
embodiment a 3x3 neighborhood of the pixel to be deringed, and at step 33 it is determined 
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whether all of the UMDVP values in this neighborhood are less than or equal to "0" or if 
the pixel is not in a homogeneous neighborhood but has a negative UMDVP value. The 
condition tested at step 33 prevents performing deringing on isolated points as well as 
excessive blurring in, e.g., texture areas, where the UMDVP values are a mix of "l"s and 
5 "0"s. 

If the condition at step 33 is satisfied, luminance values of the pixel are filtered by a 
first Filtering I at step 35, e.g., a low-pass filter using the chosen window size and which 
excludes the pixels which differ by more than a given amount, e.g., 10%, from the 
luminance value of the pixel being deringed. Thus, pixels with significantly different 
10 luminance values are excluded so that fine details are not filtered out instead of artifacts. 
Other types of filtering can be used besides low-pass, see Filtering II discussion which 
follows. 

If the condition at step 33 is not satisfied, luminance values of the pixel are filtered 
by a second Filtering II step 34, which also performs as a low-pass filter: 

15 

YJBlt(i j) = Y(i j) - AUMDVP(i j)) * hp(i j) 

where Y(i j) is the original luminance value, XUMDVP(i j)) is a function of UMDVP(i j) 
and hp(ij) is the high-pass signal. By way of example and not limitation, 
20 AUMDVP(i j)) = (l-UMDVP(i j))/a 

where "a" can be, e.g., 2, 4, 8, ... . For "a" = 4 the calculation of hp(ij) can be 
accomplished using the following filter kernel: 

0-10 

25 -1 4 -1 

0-10 
8 
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Then, the output of the filter kernel is multiplied hy 0.5 to prevent very strong low-pass 
filtering. 

After filtering, at step 36 the original values are replaced by the filtered one based 
on, for example and not in any limiting sense, the following definition of the maximum 
5 displacement: 

Max_displ = PARI + PAR2 if pixel (i j) belongs to homogenous area 
Max_displ=abs(UMDVP)*PARl +PAR2, if not homogenous and 
UMDVP(i j) <=0 
1 0 Max_displ = PAR2, otherwise, 

where, for example, PARI =30, PAR2 = 10. 

If the absolute difference of the original luminance value and the filtered value is greater 
than the Max_displ calculated above, then either the original value will be kept or shifted 
1 5 only by the Max_displ at step 36, that is, it is set to f(Y(i j), YjRlt(i j), Max_displ). b. 
Deringing of the Chrominance Components 

By way of example and not limitation, a flow of the processing steps for a 
UMDVP-controlled deringing of the chrominance signal is illustrated in FIG. 5. Consider 
a pixel located at position (ij). At step 51 it is determined whether an isolated "0" 

20 UMDVP value is found in a neighborhood of UMDVP values of "1" and if so then 
UMDVP(i j) is set to 1 for the pixel at location (i j) at step 52. A neighborhood size is 
selected, in a preferred embodiment a 7x7 neighborhood of the pixel to be deringed, and at 
step 53 it is determined whether pixel(ij) belongs to a homogeneous area. The reason a 
bigger window size is chosen is that in common digital video systems, the chrominance 

25 signal is sub-sampled with respect to the luminance signal, e.g., in a 422 color format the 
chrominance signal is sub-sampled by 2 horizontally. 
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If the condition at step 53 is satisfied, a low-pass filtering is applied at step 54 to the 
original chrominance values, i.e., in a 3x5 window to match with a 422 sub-sampling. In 
the filtering pixels are excluded for which chrominance values differ by more than a given 
amount, e.g., 10%, from the chrominance value of the pixel being deringed. Thus, pixels 
5 with significantly different chrominance values are excluded so that color mismatch is 
prevented. 

After filtering, the original values are replaced by the filtered ones at step 55 based 
on, for example and not in any limiting sense, the following definition of the maximum 
displacement for chrominance components: 

10 

Max_displ_chrom = (PARI + PAR2)/4 if pixel(ij) is belongs to homogenous area 
Max_displ_chrom = (abs(UMDVP) * PARI +PAR2)/4, if not homogenous and 

UMDVP(ij) <=0 
Max_displ_chrom = PAR2/4, otherwise, 

15 where, for example, PARI =30, PAR2 = 10. 

The max displacement for chrominance components is the same as the one used for 
luminance, except here it is scaled down by factor of 4 to prevent any color mismatch. The 
divisor is determined empirically. If the absolute difference of the original chrominance 
values and the filtered values is greater than the Max_displ_chrom calculated above, then 

20 either the original values will be kept or shifted only by the Max_displ_chrom at step 55, 
that is they are set to AU(i j), U_filt(i j), Max_dispLchrom) and y[V(i j), V_filt(i j), 
Max__displ_chrom). 

Sequential Control of Post-Processing 

Post-processing using a metric is accomplished serially, in another preferred 
25 embodiment Metric-controlled deringing (AR) followed by metric-controlled resolution 
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or sharpness enhancement (SE) is one such serial control that is possible using the control 
17. 

In view of this disclosure the various units and modules described herein can be 
implemented in either software or hardware or a combination of the two to achieve a 

5 desired performance level. Further, the post-processing algorithms and their parameters are 
included by way of example only and not in any limiting sense. Therefore, the 
embodiments described are illustrative of the principle of this invention for the use of a 
metric for the joint control of a plurality of post-processing algorithms as applied to coded 
digital video and are not intended to limit the invention to the specific embodiments 

10 described. In view of this disclosure, those skilled in the art can implement the apparatus 
comprising a control unit that uses a metric to control a post-processing unit for decoded 
digital video by determining the type, aggressiveness and order of post-processing 
algorithm application to the decoded digital video in a wide variety of ways. Further, the 
types of post-processing algorithms are not limited to those disclosed as examples and the 

15 post-processing algorithms themselves can make use of the metric in determining their own 
processing of the decoded digital video. 
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